Improved Text - Independent Speaker Identification using Fused MFCC & IMFCC Feature Sets based on Gaussian Filter
نویسنده
چکیده
requires a robust feature extraction unit followed by a speaker modeling scheme for generalized representation of these features. Over the years, Mel-Frequency Cepstral Coefficients (MFCC) modeled on the human auditory system has been used as a standard acoustic feature set for speech related applications. On a recent contribution by authors, it has been shown that the Inverted Mel-Frequency Cepstral Coefficients (IMFCC) is useful feature set for SI, which contains complementary information present in high frequency region. This paper introduces the Gaussian shaped filter (GF) while calculating MFCC and IMFCC in place of typical triangular shaped bins. The objective is to introduce a higher amount of correlation between subband outputs. The performances of both MFCC & IMFCC improve with GF over conventional triangular filter (TF) based implementation, individually as well as in combination. With GMM as speaker modeling paradigm, the performances of proposed GF based MFCC and IMFCC in individual and fused mode have been verified in two standard databases YOHO, (Microphone Speech) and POLYCOST (Telephone Speech) each of which has more than 130 speakers.
منابع مشابه
Improved Text-Independent Speaker Identification using Fused MFCC and IMFCC Feature Sets based on Gaussian Filter
A state of the art Speaker Identification (SI) system requires a robust feature extraction unit followed by a speaker modeling scheme for generalized representation of these features. Over the years, Mel-Frequency Cepstral Coefficients (MFCC) modeled on the human auditory system has been used as a standard acoustic feature set for speech related applications. On a recent contribution by authors...
متن کاملImproved Closed Set Text-Independent Speaker Identification by Combining MFCC with Evidence from Flipped Filter Banks
A state of the art Speaker Identification (SI) system requires a robust feature extraction unit followed by a speaker modeling scheme for generalized representation of these features. Over the years, Mel-Frequency Cepstral Coefficients (MFCC) modeled on the human auditory system has been used as a standard acoustic feature set for SI applications. However, due to the structure of its filter ban...
متن کاملLanguage and Text-Independent Speaker Identification System Using GMM
This paper motivates the use of Dynamic Mel-Frequency Cepstral Coefficient (DMFCC) feature and combination of DMFCC and MFCC features for robust language and text-independent speaker identification. MFCC feature, modeled on the human auditory system has been the widely used feature for speaker recognition because of its less vulnerability to noise perturbation and little session variability. Bu...
متن کاملInverted Mel Feature Set based Text-Independent Speaker Identification using Finite Doubly Truncated Gaussian Mixture Model
This paper provides an efficient approach for text-independent speaker identification using the Inverted Mel-frequency Cepstral Coefficients as feature set and Finite Doubly Truncated Gaussian Mixture as Model (FDTGMM). Over the years, Mel-Frequency Cepstral Coefficients (MFCC), modeled on the human auditory system, has been used as a standard acoustic feature set for speech related application...
متن کاملNew Filter Structure based on Admissible Wavelet Packet Transform for Text-Independent Speaker Identification
Identical acoustic features like Mel frequency cepstral Coefficients (MFCC)and Linear predictive cepstral coefficients (LPCC) are being widely used for different tasks like speech recognition and speaker recognition, whereas the requirement of speaker recognition is different than that of speech recognition. In MFCC feature representation, the Mel frequency scale is used to get a high resolutio...
متن کامل